Open-Source Consumer-Grade Indic Text To Speech

نویسندگان

  • Andrew Wilkinson
  • Alok Parlikar
  • Sunayana Sitaram
  • Tim White
  • Alan W. Black
  • Suresh Bazaj
چکیده

Open-source text-to-speech (TTS) software has enabled the development of voices in multiple languages, including many high-resource languages, such as English and European languages. However, building voices for low-resource languages is still challenging. We describe the development of TTS systems for 12 Indian languages using the Festvox framework, for which we developed a common frontend for Indian languages. Voices for eight of these 12 languages are available for use with Flite, a lightweight, fast run-time synthesizer, and the Android Flite app available in the Google Play store. Recently, the baseline Punjabi TTS voice was built end-to-end in a month by two undergraduate students (without any prior knowledge of TTS) with help from two of the authors of this paper. The framework can be used to build a baseline Indic TTS voice in two weeks, once a text corpus is selected and a suitable native speaker is identified.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BATS: The Blind Audio Tactile Mapping System

The BATS project focuses on helping students with visual impairments access and explore spatial information using standard computer hardware and open source software. Our work is largely based on prior techniques used in presenting maps to the blind such as text-to-speech synthesis, auditory icons, and tactile feedback. We add spatial sound to position auditory icons and speech callouts in thre...

متن کامل

Acharya - A Text Editor and Framework for working with Indic Scripts

This paper discusses an open source project1 which provides a framework for working with Indian language scripts using a uniform syllable based text encoding scheme. It also discusses the design and implementation of a multi-platform text editor for 9 Indian languages which was built based on this encoding scheme.

متن کامل

Ultra High Video Data Compression for Android Devices Using OpenCV and other Open-Source Tools

We describe in this paper how to use open-source resources, in particular OpenCV, to design and implement an Android application that achieves ultra-high video compression for special videos, which consist of mainly a human face and speech, such as the scene of a news announcement or a teleconference. Google Voice Recognition[30], which is a free and open Android tool, is utilized to convert th...

متن کامل

The Festvox Indic Frontend for Grapheme-to-Phoneme Conversion

Text-to-Speech (TTS) systems convert text into phonetic pronunciations which are then processed by Acoustic Models. TTS frontends typically include text processing, lexical lookup and Grapheme-to-Phoneme (g2p) conversion stages. This paper describes the design and implementation of the Indic frontend, which provides explicit support for many major Indian languages, along with a unified framewor...

متن کامل

Brahmi-Net: A transliteration and script conversion system for languages of the Indian subcontinent

We present Brahmi-Net an online system for transliteration and script conversion for all major Indian language pairs (306 pairs). The system covers 13 Indo-Aryan languages, 4 Dravidian languages and English. For training the transliteration systems, we mined parallel transliteration corpora from parallel translation corpora using an unsupervised method and trained statistical transliteration sy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016